Model Selection

Semantic similarity calculation

# Semantic similarity calculation

Medical Embedded V4

This is a multilingual sentence embedding model that can map sentences and paragraphs to a 768-dimensional vector space, suitable for tasks such as clustering and semantic search.

Text Embedding Supports Multiple Languages

Langcache Crossencoder V1 Ms Marco MiniLM L12 V2

A CrossEncoder model based on the Transformer architecture, fine-tuned on the Quora question pair dataset, used to calculate scores for text pairs, suitable for semantic similarity and semantic search tasks.

Text Classification English

aditeyabaral-redis

Langcache Crossencoder V1 Ms Marco MiniLM L6 V2

This is a model based on the Cross Encoder architecture, specifically designed for text pair classification tasks. It is fine-tuned on the Quora question pair dataset and is suitable for semantic similarity judgment and semantic search scenarios.

Text Classification English

aditeyabaral-redis

Langcache Embed V2

A sentence transformer model fine-tuned based on Redis Langcache Embed v1, used to generate 768-dimensional sentence embedding vectors

Dragonkue KoEn E5 Tiny

This is a sentence-transformers model fine-tuned from intfloat/multilingual-e5-small, trained with Korean query-passage pairs to enhance performance in Korean retrieval tasks.

Text Embedding Supports Multiple Languages

All MiniLM L2 V2

This model is distilled from all-MiniLM-L12-v2, achieving nearly 2x faster inference speed while maintaining high accuracy on both CPU and GPU.

Text Embedding Supports Multiple Languages

Snowflake Arctic Embed L V2.0 Ko

This is a SentenceTransformer model fine-tuned from Snowflake/snowflake-arctic-embed-l-v2.0, trained on a clustering dataset. It maps sentences and paragraphs into a 1024-dimensional dense vector space, suitable for semantic text similarity and semantic search.

Text Embedding Supports Multiple Languages

Jina Embeddings V3

Jina Embeddings V3 is a multilingual sentence embedding model supporting over 100 languages, focusing on sentence similarity calculation and feature extraction tasks.

Transformers Supports Multiple Languages

Context Skill Extraction Base

This is a model trained based on sentence-transformers, capable of mapping sentences and paragraphs into a 768-dimensional dense vector space, suitable for various tasks such as semantic text similarity calculation and semantic search.

A sentence embedding model fine-tuned on the Korean triplet dataset based on the Alibaba-NLP/gte-multilingual-base model for semantic similarity calculation

Text Embedding Supports Multiple Languages

Mind Map Blog Model

This is a sentence transformer model fine - tuned from sentence-transformers/paraphrase-multilingual-MiniLM-L12-v2, which can map text to a 384 - dimensional vector space for tasks such as semantic similarity calculation.

Stella En 400M V5 Cpu

stella_en_400M_v5_cpu is a model that performs excellently in multiple natural language processing tasks, especially in tasks such as classification, retrieval, clustering, and semantic text similarity.

Gte Base Korean

A Korean sentence embedding model fine - tuned on Alibaba - NLP/gte - multilingual - base, supporting tasks such as semantic text similarity calculation and semantic search.

E5 All Nli Triplet Matryoshka

This is a sentence-transformers model fine-tuned on intfloat/multilingual-e5-small, designed to map sentences and paragraphs into a 384-dimensional dense vector space, supporting tasks such as semantic text similarity and semantic search.

Omartificial-Intelligence-Space

Russian universal sentence encoder, based on the sentence-transformers framework, specifically designed to extract 1024-dimensional dense vectors for Russian text

Text Embedding Other

BERT model for computing Russian sentence embeddings, developed based on cointegrated/LaBSE-en-ru with optimized Russian language processing performance

Transformers Other

Jina Embeddings V2 Base Zh

Jina Embeddings V2 Base is a sentence embedding model optimized for Chinese, which can convert text into high-dimensional vector representations for calculating sentence similarity and feature extraction.

Text Embedding Supports Multiple Languages

Llm2vec Meta Llama 3 8B Instruct Mntp Supervised

LLM2Vec is a supervised learning model based on Meta-Llama-3, focusing on natural language processing tasks such as sentence similarity, and supporting various application scenarios such as text embedding, information retrieval, and text classification.

Large Language Model English

Sentence Transformers Multilingual E5 Small

multilingual - e5 - small is a model that performs excellently in multilingual text processing tasks, supporting various tasks such as classification, retrieval, clustering, re - ranking, and semantic text similarity.

Text Embedding Supports Multiple Languages

beademiguelperez

Ruropebert Classic Base 512

A Russian encoder model based on the RoPEBert architecture, trained using cloning methods, supports 512-token context, and surpasses the original ruBert-base model in quality

Large Language Model

Transformers Other

This is a Hebrew embedding model based on sentence-transformers, capable of mapping sentences and paragraphs into a 768-dimensional dense vector space, suitable for tasks such as clustering or semantic search.

Transformers Other

Multi Sentence BERTino

This is a sentence transformer model based on BERTino, capable of mapping Italian sentences and paragraphs into a 768-dimensional dense vector space, suitable for tasks such as clustering or semantic search.

Transformers Other

Simcse Roberta Large Zh

SimCSE(sup) is a model for Chinese sentence similarity tasks. It can encode sentences into embedding vectors and calculate the cosine similarity between sentences.

Transformers Chinese

Klue Roberta Base Klue Sts

This is a model based on sentence-transformers that can map sentences and paragraphs to a 768-dimensional dense vector space, suitable for tasks such as clustering and semantic search.

Robbert 2022 Dutch Sentence Transformers Onnx

ONNX version of the Dutch sentence transformer based on the RobBERT model, mapping text to a 768-dimensional vector space, suitable for semantic search and clustering tasks

Transformers Other

Sentence Transformers Alephbertgimmel Small

This is a Hebrew sentence similarity calculation model based on sentence-transformers, which can map text to a 512-dimensional vector space for semantic search and clustering tasks

Transformers Other

Sup Simcse Ja Large

This is a Japanese sentence embedding model trained using the supervised SimCSE method, specifically designed for generating high-quality sentence representations.

Transformers Japanese

Bge Base En V1.5 Ct2

BGE Base English v1.5 is a transformer-based sentence embedding model, specifically designed for extracting sentence features and calculating sentence similarity.

Transformers English

This is a vocabulary-pruned version of intfloat/multilingual-e5-base, retaining only English and Russian vocabulary.

Transformers Supports Multiple Languages

Sentence Transformers Gte Large

This is a sentence embedding model based on sentence-transformers, capable of converting text into 1024-dimensional dense vector representations, suitable for tasks like semantic search and text clustering.

Distiluse Base Multilingual Cased V2

This is a multilingual sentence embedding model that maps text to a 512-dimensional vector space, suitable for semantic search and clustering tasks.

Transformers Other

Indosbert Large

indoSBERT-large is an Indonesian sentence embedding model based on sentence-transformers, which maps sentences and paragraphs into a 256-dimensional vector space, suitable for tasks such as clustering and semantic search.

Text Embedding Other

Compositional Bert Large Uncased

CompCSE and SimCSE are contrastive learning-based sentence embedding models for calculating sentence similarity.

Transformers English

perceptiveshawty

Text2vec Base Chinese Sentence

A Chinese sentence embedding model based on the CoSENT (Cosine Sentence) model, mapping sentences to a 768-dimensional dense vector space, suitable for tasks such as sentence embedding, text matching, or semantic search.

Transformers Chinese

A model based on sentence-transformers for mapping target words in sentences to a 1024-dimensional vector space, supporting word similarity calculation and semantic search tasks.

Abstract Sim Query

A model that maps abstract sentence descriptions to matching sentences, trained on Wikipedia using a dual-encoder architecture.

Transformers English

Congen WangchanBERT Small

This is a sentence embedding model based on the ConGen framework, capable of mapping sentences to a 128-dimensional dense vector space, suitable for tasks such as semantic search.

Sentence Transformers Paraphrase Multilingual Mpnet Base V2

Multilingual sentence embedding model that maps text to a 768-dimensional vector space, suitable for semantic search and clustering tasks

This is a Polish-language sentence embedding model capable of mapping sentences and paragraphs into a 1024-dimensional vector space, suitable for semantic search and clustering tasks.

Transformers Other

Sentence Transformer Ult5 Pt Small

A sentence transformer model based on ult5-pt-small that maps sentences and paragraphs into 512-dimensional vectors, suitable for tasks like text clustering, similarity calculation, and semantic search.

Featured Recommended AI Models

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase